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Cn ■ ABSTRACT 

We present a sample of Cepheid variable stars towards M31 based on the first year of regular M31 
observations of the PS1 survey in the rpi and ipi filters. We describe the selection procedure for 
Cepheid variable stars from the overall variable source sample and develop an automatic classification 
c"| \ scheme using Fourier decomposition and the location of the instability strip. We find 1440 fundamental 

mode (classical 6) Cep stars, 126 Cepheids in the first overtone mode, and 147 belonging to the 
Population II types. 296 Cepheids could not be assigned to one of these classes and 354 Cepheids 
| were found in other surveys. These 2009 Cepheids constitute the largest Cepheid sample in M31 

known so far and the full catalog is presented in this paper. We briefly describe the properties of our 
sample in its spatial distribution throughout the M31 galaxy, in its age properties, and we derive an 
apparent period-luminosity relation (PLR) in our two bands. The Population I Cepheids nicely follow 
the dust pattern of the M31 disk, whereas the 147 Type II Cepheids are distributed throughout the 

■ halo of M31. We outline the time evolution of the star formation in the major ring found previously 
and find an age gradient. A comparison of our PLR to previous results indicates a curvature term in 

I the PLR. 

Subject headings: Catalogs - distance scale - Galaxies: individual (M31, NGC 224) - Galaxies: star 
q formation - Local Group - Stars: variables: Cepheids 

■ 1. INTRODUCTION 

Classical S Cep variable stars ar e young mass ive s tars of Population I in th eir post main sequence evolution. Due to 

■ their period- luminosity relation ()Leavittlll908l and lLeavitt fc Pickerindri912f ). they are the anchor of the extragalactic 
dista nce scale. As such, they are a central topic of many investigations in nearby galaxies (see e.g. iFreedman et all 
I2001L for a historical s ummary including the early HST projects, more recent work is discussed e .g. bvlRiess et al.ll2011l 
and TGerke et al.lfeoill ). S Cep stars can provide distance estimates out to several 10 Mpc (e.g. iRiess et al.ll2011l ). but 
the major uncertainty of these distance values is still related to the local calibration which until recently totally rested 

\Q , on the LMC distance. As there is an ongoing and unsolved debate to which extent the peri od-luminosity relation (PLR) 

" ■ of S Cep stars depends on stellar properties like e.g. the metallicity (Majaess etaDEoTJ), the LMC is not an optimal 

choice given it s signi ficantly reduced metallicity compared to the large spirals to which the PLR is nor mally applied 
Maiaess et alJ (I2011D summ arize the recent discussion with the conclusion that the period Wesen heit (lMadorelll97i 
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Madordll982l . lOpolskilll983h relation (from colors V and /) does not depend on metallicity while IShappee fc Stanel. 
|20l"lir based on a large sample of S Cepheid stars observed with HST in two fi elds of the spiral ga laxy M 101, conclude 
the opposite and in addition report supporting evidence from other studies (|Gerke et al.ll201lj) . On the theoretical 
side there have been predictions that the Wesenheit PLR should depend on metallicity if based on photometric bands 
sensitive to metal line blanketing (like the combination of Joh nson B and V") while the red band combinations of 
the Johnson-Cousins system should not show this dependence (jFiorentino et alJl2007t ). The discussion of the PLR 
properties - in individual photo metric bands - has become even more complicated as some authors reported a non- 
universal slope for the PLR (see Sandag e fc Tammannll200"8l and references therein), which has not been confirmed by 
others and still is under debate. 

The recent discussion on the calibration of the cosmological distance scale and the subsequent determination of the 
Hubble constant has somehow shadowed that these relatively young and massive stars can be also used as excellent 
tracers of the recent evolution of stellar populations as they allow number counts in precisely known time bins. 
The LMC has been the historical place for the establishment of the PLR (|Leavitt fc Pickerindll912l ) as it is close by 
and can be well resolved from the ground while already distant enough that the member stars are essentially at the 
same distance. Thus, the LMC (and SMC) were intensively studied for variable stars and transient phenomena by 
various projects yielding also extensive and well observed samples of Cepheid stars in fundamental, first and second 
overtone pulsation as well as Population II pulsators (W Vir, RV Tau etc). The most prominent example is probably 
the OGLE proje ct which provided a well established PLR based on B, V, and I observations of ~ 3400 6 Cep stars in 
LMC and SMC Ijtldalski et aJJIl999lUdalski et al.lll999hllUdalski et al.lfl999dY 
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The nearest example of a Cepheid population in a typical large spiral galaxy of roughly solar metalli city and outsid e 
our own Milky Way ga laxy is the one in the Andromeda galaxy M31. The first detections go back to Hubble (1929). 
IBaade fc Swopel ((19631 summarize the early observational results which already comprised a sample size of ~ 260 
Cepheid variables. 

Wide field CCD imagers time series studi es brought M 31 again into the foc us of several projects aiming to detect and 
stud y transient s and variables stars (e.g. lMacrill200 j the DIRECT project: iFliri et al.ll2006l IVilardell et al.ll2007t ). 
The lFliri et all ((200(1 sample (based on the WeCAPP micro lensing proiect ~(|Riffeser et al.l l200lh was observed over 
almost 8 years with ground based CCD imaging centered on the M31 bulge. Therefore, the S Cep stars as typical 
Population I objects are only a minor contribution (33) to the total sample of variable stars and they are even 
out numbered by other typ es of Cepheids (93). 

The IVilardell etail (|2007h sample was obtained from observations covering the north-west quarter of the M31 disk. 
Spanning almost 5 years, the ~ 250 observations in B and V allowed to obtain a sample of 416 Cepheid stars, to study 
their light curves in detail and to derive distance estimates to M31. A shortcoming of the DIRECT data is that the 250 
individual observations were obtained in 21 nights only. The window function (inherent to the observations) implies a 
strong bias against finding objects within certain period ranges. This can distort results like e.g. the period-number 
distribution of the sample. 

Ground-based M3 1 observations have been supplemented with single epoch high resolution HST-imaging (PHAT, 
Uohnson et al.l l2012f l . especially in the NIR. This allowed to improve on some of th e short-comings o f ground-based 
data for this galaxy and paved the way for a precise distance determination of M31 (jRiess et al.ll2012l ). 
In this paper we expand the studies of the Cepheid population in M3 1 with our wide field ground based data taken 
within the first year of the Pan-STARRS 1 project ((Kaiser et a l. 2002). Among others, the two major advantages of 
our data are that, firstly, they cover the complete M31 disk, and, secondly, they are much more evenly distributed in 
time using observations taken in up to 102 nights (~ 80% of the nights have two epochs, giving a total of 183 epochs). 
Throughout this paper we will refer to Fundamental mode (FM) Cepheids (classical 5 Cep variable stars), First 
overtone (FO) Cepheids (classical S Cep variable stars pulsating in the excited mode of the first overtone), Type II 
Cepheids (W Virginis) and use the term Cepheids for all three subgroups. BL Herculi stars are too faint to be detected 
with our survey and RV Tauri are excluded manually (c.f. Section [3]), so that our Type II Cepheids only consist of W 
Virginis stars. 

This paper is organized as follows: In Section[2]we present the PAndromeda survey, our observations in 2010 and 2011, 
and a description of our data reduction, including the determination of the light curves for the resolved sources. Our 
period determination is described in Section [3] The Cepheids detected in the PAndromeda data-set are presented in 
Section [U This Section also describes the automatic Cepheid detection and classification. An overview of the catalog 
is shown in Section[5j We discuss our results, such as (among others) the PLR, the spatial distribution of the Cepheids 
and the spatial age distribution, in Section HI We conclude the paper in Section [7] 

2. OBSERVATIONS & DATA REDUCTION 

2.1. PAndromeda 

The 1.8 m Panoramic Survey Telescope and Rapid Response System (Pan-STARRS, iHodapp et al. 2004, Kaise r et al.l 
2002) on Haleakala, Maui, be gan its regular survey in spring 2010. Pan-STARRS 1 (PS1) uses the Giga Pixel Camera 
(GPC, iTonrv fc Onakal 120091) which is currently' one of the largest cameras in the world. It consists of 60 OTAs 
(orthogonal transfer arrays), each of them segmented into an array of 8 x 8 cells. The total number of pixels is 1.4 • 10 9 , 
with a pixel size of 0.258 arcsec. The FOV of one OTA is 21' x 21'. The aims of PS1 are to map 37r of the sky in 
gpi, fpi, ipi, zpi, ypi bands. In addition to the large Sir area some fields are visited more frequently and have deeper 
exposure times. One of these so called medium deep fields targets the Andromeda galaxy (M31). With the ^7 deg 2 
field of view of Pan-STARRS, a complete monitoring of the disk and large parts of the halo is possible with one single 
exposure. The PAndromeda survey observes M31 for 0.5 h per night (including overhead) during a period of 5 months 
per year, equivalent to 2% of the overall PS1 time. We split the integration time into two visits per night (separated 
by 3 h ± 0.5 h), for the whole season^. During the first two seasons we focused on the filters rn and ipi (|Tonrv et al.l 
I2012D . Observing M31 two times per night wit h the rpi and ip i filters is a consequence of the PAndromeda survey 
goal to find microlensing events as described in iLee et all (|2012h . 

In the first season PAndromeda monitored M31 from 2010-07-23 to 2011-12-27. In this work we used in addition part 
of the second season from 2011-07-25 to 2011-08-12. This results in a gap of about half a year in the light curves and 
gives us a total of 183 epochfl 

2.2. Data reduction 

The details of the data reduction for PAndromeda are described in ILee et ail (|2012f) . In the following section we 
summarize the data reduction steps and point out differences to the approach taken in this work. 
The GPC1 exposures are de-biased, flat-fielded, and astrometrically calibrated using the Pan-STARRS Image Pro- 
cessing Pipeline (IPP; Magnicr [2003) . During this process the images are remapped to a common grid, the so called 
skycells. These warped images have a pixel scale of 0.200 arcsec/pixel, which is smaller than the natural pixel size of 

5 Per visit 7 exposures of 60s in the rpi band and 5 exposures of 60s in the ipi band are taken. The total integration time per night is 
therefore 1440 s. The remaining time is spent on readout, focusing and filter change. 

6 Due to masking the total number of epochs for a light curve can be less than 183. 
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the GPC1 (0.258 arcsec/pixel). For better dat a management we int egrated our pipeline ()Koppenhoefer et al.ll20lil) 
into the AstroWISE data management system ([Valentiin et al.l 120071 ). 

In this work we use only data from 24 skycells, namely 039-041, 051-054, 064-068, 076-079, 088-092, 102-104. As shown 
in Fig. 12. 2\ the skycell layout is not optimized in the sense that it does not follow the detector boundaries. We plan 
to optimize the skycell layout and use an intrinsic pixel scale close to 0.258 arcsec/pixel in the future. The following 
data reduction steps are applied to each skycell and both filters rpi and ipi. 
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Fig. 1. — Skycell layout for the first season. As can be seen the entire disk of the Andromeda galaxy, M 32 and NGC 205 can be 
imaged with one pointing. The underlying M31 image is a single rpi-band frame, which was chosen to have relatively small masked area. 
The masked areas (in black) are e.g ill-functioning chips, video guide star cells and areas on the focal plane with sub-standard imaging 
performance. 



Since we use difference image photometry (jTomanev fc Crofts! 1 1996D . we have to produce a stack of images with the 
best seeing, the so called reference image. We photometrically align the 70 best seeing images, so that the images 
have the same zero point and the same (non zero) sky background. In the next step we repla ce masked are as in the 
photometrically aligned images with values from another image that has the most similar PSF (|Riffeseril200l p. 134). 
The weighted stack of the aligned images is called the referen ce image. 

We use SExtractor to detect sources on the reference image ()Bertin fc Arnouti 119961) . For the data utilized in this 
work, SExtractor is used in a mode that modifies the detection threshold depending on the background. While this 
mode is very useful it hampers completeness tests. 

For all sources in the reference image, we perform PSF photometry on a background (sky and M31) subtracted reference 
image. In order to obtain the background in the reference image, we fit a bicubic spline model. To suppress blending 
effects we iteratively subtract neighboring stars. The fluxes measured in the reference image are added later to the 
fluxes in the difference images, so as to create the light curves. The flux error in the reference image determined by 
the PSF photometry does not take into account how well our constant PSF model matches the real sources in the 



4 



reference frame which can show slight PSF-variation over the field. This can result in wrong flux errors. To obtain 
reasonable errors we rescale the error of each source with the °f its PSF fit. 

To minimize masking and to obtain a better signal-to-noise ratio we stack the images of each visit. The procedure is 
the same as for the reference images. The visit stacks are photometrically aligned to the reference image, so that the 
visit stacks and the reference image have the same zero point and sky background. At this point they only deviate in 
signal-to-noise and PSF-shape. 

To create the difference images we first align the PSF of the reference image to match the PSF of the visit stack by 
calculating a constant convolution kernel by least-squares minimization from all pixels in subareas of 75" x 75" (see 
lAlard k, Luptonlll998l ). This PSF aligned reference image is then subtracted from the visit stack. On this difference 
image we perform PSF photometry to obtain light curves. The PSF is constructed with sub-pixel precision from the 
convolved reference image using isolated stars. Since we only measure the flux difference we add the flux measured in 
the reference image to this flux difference to obtain the light curves for the resolved sources. Fig. [2] summarizes the 
PAndromcda data reduction pipeline. 
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Fig. 2. — Summary of the PAndromeda dat a reduction pipe line (taken from ILee et al.l 120121 (modified)). We use 70 images for the 
reference image stacking, instead of the 30 that ILee et all Q2012T ) use. 

Note that the astrometric precision and photometric stability have been discussed in ILee et~a l. ( 2012). For the flux 
calibra tion we make use of th e fact, that the rpr and ipi bands are rather similar to the Sloan Digital Sky Survey 
SPSS; | Abazaiia n et al.|[2009D rgrjss ancl *sdss bands and thus the flux calibration should be insensitive to color terms 
Tonrv et alJl2012f >. To calibrate our magnitude to the SDSS magnitude system with stars from SDSS we use: 

msoss = -2.5 log [F le{ (PSF rcf ) + AF diff (PSF diff ) + ZP PSF ] (1) 

F re f (PSF re f) is the flux of a source determined by PSF photometry in the reference image and AF^s (PSF^g) the flux 
determined by PSF photometry in the difference image. ZPpsf accounts for the difference between SDSS magnitudes 
and our instrumental magnitudes and does also consider the PSF construction. 

Our pipeline does not take into account the correlated noise that arises from the change of the pixel scale. To quantify 
this we constructed light curves of all pixels, normalize each single deviation from the fitted mean by its error. The 
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distribution of these normalized deviations is clearly dominated by the non- varying pixels. Then the correction factor 
can easily be estimated by comparing this distribution with a standard Gaussian-distribution (sig = 1). This rescaling 
factor is the same for every skycell and filter. 
Throughout this paper, we will use the AB magnitude system. 

3. PERIOD DETERMINATION 

The periodicity of the light curves is determined with the software SigSpec ([Reegenl 12007). SigSpec calculates a 
quantity called spectral significance (sig) for each peak in the discrete Fourier transform (DFT) amplitude spectrum 
of a light curve. Similar to the false alarm probability ®fa(A) 

sig(A):=-log[$ FA (,4)] (2) 

is a measure for the probability that a peak arises from noise. This value does not only depend on the amplitude and 
frequency, but also on the phase and the sampling of the light curve. This f act gives SigSpec an adva ntage over the 
use of other Fourier based algorithms such as the Lomb-Scargle algorithm (|Lomb|[l976l IScargldll982f) . While both 
methods produce comparable results concerning the period, we found that SigSpec finds more periodic light curves 
over a given false alarm probability threshold than the Lomb-Scargle algorithm. This is due to the fact that certain 
periods have to be excluded in the Lomb-Scargle algorithm, depending on the sampling of the light curve in order to 
avoid aliasing. 

In the following paragraph we outline the parameters with which we run SigSpec. As statistical weight we use the 
inverse variance (wi — <J~ 2 ) of the PSF flux measurement and we use [JD — 2450000] for the epoch. We apply a 
sig-threshold of 5 (siglimit 5) to determine the periodicity of the light curves. This means that in less than one out 
of 10 5 cases pure noise could produce the measured period. With a lower threshold the number of detections would 
increase (e.g. by a factor of ~ 10 with a threshold of 3), but most of the detections would be low signal-to- noise 
detections. With the parameter uf req 1 we define the lowest period to be considered as 1 day. We exclude smaller 
periods because they can hardly be detected with a sampling of two visits per night. An upper limit for the period has 
not been defined, but a period cut is performed later on. The last two parameters (multisine : lock and iterations 
1) force SigSpec to consider only the most significant pealQ in the DFT amplitude spectrum ensuring that noise has 
no impact on the determined period. We use no variability index since SigSpec only determines periods for variable 
sources. 

Inspection by eye of the folded light curves, that SigSpec detects, showed the good performance in the period deter- 
mination of SigSpec. One problem, however is the lack of an upper period limit that permits SigSpec to determine 
periods even if the signal is not periodic^. The inspection showed that for large periods some of the folded light curves 
look like Cepheid light curves. We did not exclude those light curves at this stage, but rather applied a period cut 
later on. As our data set also contains 10 days of observations of season 2011, the data show a gap of half a year 
between these data points and the 2010 data points. In some cases the 2011 data points have a small flux shift from 
the 2010 data points in the folded light curve. This may have different reasons. Firstly the shift could be due to flux 
variations. Secondly it could be caused by a small deviation from the true period, which can be observed in the folded 
light curve if the period is small enough. And lastly the frequency spacing used by SigSpec could be too low. Also in 
the case of RV Tauri variables SigSpec detects the wrong period, as it identifies the time span between the primary 
and secondary minimum as the period. We identified and rejected those light curves manuall\T n 1. From 724894 sources 
that were detected in the rpi band and which have light curves in the rpi and «pi baneful. SigSpec finds 75362 periodic 
variable light curves with sig > 5. 

Since SigSpec docs not determine the period error, we use the bootstrap method to estimate the period uncertainties. 
A random sampling is drawn from the light curve (one epoch can be drawn multiple times) and the period for that 
sampling is determined with SigSpec (with a sig-threshold of 1). This procedure is performed 1000 times for each 
light curve, so that the la error can be determined from the resulting distribution. The obtained period errors for the 
Cepheids of the 3-dimensional parameter space classified Cepheid catalog (see Section |4~B1 are shown in Fig. [3] The 
classification of the different Cepheid types shown in Fig. [3J as well as the meaning of the 3-dimensional parameter 
space classified Cepheid catalog, is described in the next section. 

In general the period errors are small, except for some light curves where the distribution has an additional peak at 
~ 1.5 d due to aliasing. For those light curves the large error is only caused by this additional peak. It is not a broader 
distribution that causes the larger error and therefore the period error would be small if there is no aliasing. Since we 
find no aliasing in the final Cepheid light curves and the systematics have a larger effect than the period errors, we 
disregard the period errors, which means that the period errors shown in the plots do not contribute to any selection 
criteria or fit. 



7 The false alarm probability <E>fa( ^) is the integ rated probability density function and describes the probability that an amplitude 
exceeds a given limit A as described in [Rccgcn ( 2007f) . 

8 The SigSpec default value would perform an iterative sequence, the so called prcwhitening sequence. After the most significant peak 
is identified, a fit to all signals that are identified so far is performed. This fit is then prewhitend (subtracted) from the spectrum and the 
next iteration starts until the maximum peak is under the threshold. 

9 SigSpec can determine a period for almost every signal that is not constant. If the signal is not periodic and not constant the program 
usually detects the total observation time as the period. 

10 RV Tauri light curves are rather easy to distinguish from the Cepheid light curves, since the RV Tauri light curves appear as an overlay 
of two shifted light curves. 

11 We detect the variable sources in the rpi band and therefore there is always a light curve in the rpi band for each variable source. 
But the same source can be masked in the ipi band so that there is no light curve in the ip± band. 
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Fig. 3. — Period errors P e determined through the bootstrap method for the 3-dimensional parameter space classified Cepheid catalog. 
The large period errors of P e > 0.5 days are caused by aliasing produced by ~ 3.5% of the bootstrap light curves. 

4. CEPHEID DETECTION AND CLASSIFICATION 

As described in the previous section we narrowed the number of variable sources down to those that have periodic 
light curves. In this sample of variable sources we want to identify the Cepheids. 

The classical approach to find the Cepheids (e.g. lUdalski et al.l Il999al iVilardell et alJ l2007) is to preselect sources 
in a certain brightness and color range and to visually inspect the m. But the position of the instability strip spans 
an area depending on brightness and color (|Fiorentino et al.l |2002| and iMarigo et al.l I20081 including the updates by 
iGirardi et al.|[2010l) and is incorrectly described by a rectangle in the color-magnitude diagram (se e Fig. I4.2I). For 
our color cut we therefore use the instability strip of the Cepheids translated into our filter system (jFiorentino et al.l 
I20021 iMarigo et aT1l2008l IGirardi et al.ll2010h . We choose a different approach than the visual inspection and establish 
an automatic Cepheid detection. The motivation for this is that the selection based purely on light curves is always 
biased, in particular if the light curves become more noisy. We define limits in a 3-dimensional parameter space which 
define our Cepheids. In order to define the parameter space we select a subsample of the periodic light curves and 
visually select the Cepheids. The manual selection is of course also biased, but the advantage of this method is that the 
selection of the remaining Cepheids only depends on the defined parameter space. The parameters used are derived 
from a Fourier decomposition of the light curves. With this method we were also able to identify the Cepheid type. 
The fundamental mo de C epheids and the first overtone Cepheids can be distinguished this way as e.g. shown by 
lUdalski et all (U999ah and IVilardell"et~aT1 (12001 . We can additionally also distinguish the Type II Cepheids from the 
other two mentioned Cepheid types. 

Additionally we impose certain requirements on the remaining variable sources detected by SigSpec, so as to exclude 
bad light curves. 

In the following Sections we describe our approach that we outlined here. 

4.1. Fourier decomposition 

A Fourier decomposition of the fold ed light curves can be used to distinguish be tween fundamental mode (FM) Cepheids 
and first overtone (FO) Cepheids (|Udalski et alJ[r999aL IVilardell et aLll2006t ). We also use Fourier decomposition to 
identify Type II Cepheids and to define the 3-dimensional parameter space in which we detect the Cepheids. 
Wc fit a Fourier series to the light curve of the form 

-1 N 

m(t) = ao + a n ■ cos(nu>t) + a n ■ sin(nu>t) (3) 

n=— JV n=l 

with P being the period of the light curve determined by SigSpec), uj = 2£ and t g [0, P]. Prior to the fit we convert 
the flux to magnitudes and fold the light curves, so that they show the variation of the magnitude over the phase 9 
times the period" 1 ^! P. With this fit we determine the minimum and maximum of the light curve and the coefficients 
a n . We also calculate the magnitude of the mean flux, needed later for the PLR. This is different to a mean magnitude, 
which we would get if we take the mean of the light curve. For this reason we transform the Fourier series fit back to 
flux and calculate the mean flux and then convert this to a mean magnitude. 



- ■ UN Cepheid 
■ FM Cepheid 
=-» FO Cepheid 



- i type II Cepheid 




That is why t £ [0, P] and not t £ [0, 1], what would be the case for folded light curves that vary over the phase. 
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For an analysis of the Fourier decomposition we transform the Fourier series to the form: 

N 

m[t) = bo + bk ■ cos(kujt + tp^) (4) 
fe=l 

with bk — \J a2 -k + a k an d — arctan . This allows us to calculate the amplitude ratio 

A 21 , | (5) 

and the phase difference 

<P2i = ¥2 - 2 • ipi (6) 

for each light curve (jVilardell et al.l [2007T ) . The period P and the coefficients A 2 ± and <f2i are used to define the 
3-dimensional parameter space (P, A21, 1^21) and to identify the Cepheid type. 

We chose N = 5 for the Fourier series fit. Other degrees of freedom (d.o.f.) would result in slightly different amplitudes 
a n . In most cases the deviations of the a n from other d.o.f., compared to the a n of N = 5, is small, but there are also 
light curves with significant deviations for higher d.o.f.. While N — 5 is sufficient to fit bumps (c.f. left panel Fig. 
IT2|) in the light curves, higher d.o.f. are more susceptible to 'overshoots' and tend to be m ore influenced by ou tliers. 
This is why we do not determine the optimal d.o.f. for each ligh t curve as e.g. descri bed in lDeb fc Singhl ([2009). The 
principal component analysis (PCA) technique demonstrated in lDeb fc Singhl (|2009( ) does not offer any advantage in 
the form of saving computation time, since we use a linear least squares fit and not a non linear fit for which the PCA 
is faster. 

Additionally we derive a polynomial fit with two boundary conditions of the form: 

K 

p{t) =a + Y, a n t n with p(0) = p(P) A p'(0) =p'{P) Me [0, P] (7) 

n=l 

to the folded light curves (in magnitudes). For this fit a change in the number of d.o.f. also changes the a n , so we 
chose K = 10 in order to have the same number of d.o.f. as in the Fourier series fit. With this polynomial fit we have 
an alternative light curve description to the Fourier series. The best fitting Fourier and polynomial fits arc affected by 
data gaps in a different way and they produce different 'overshoots'. We compare the Fourier and polynomial fits to 
identify and exclude badly fitted light curves, as we will describe in the next section. 



4.2. Wesenheit-color cuts 

Young stars are located in spiral arms which are also populated by interstellar dust. The dust in the arms show a 
clumpy distribution and Cepheids may be either located before, within, or behind this dust. 

Although Population II Cepheids are not located in the M31 disk it is not a priori known whether they stay in front 
of or behind the disk. Therefore, even for the majority of the Population II objects an understanding of the internal 
reddening is required for each single object. 

As already mentioned the preselection of sources in a certain brightness and color range depends on the instability 
strip. But it also depends on the reddening. By using the Wesenheit magnitude the brightness is independent of 
reddening and only the color is affected by extinction. For this reason we use the Wesenheit-color plane and select 

Cepheids according to the location of the instability strip. 

The Wesenheit magnitude is defined to be independent of reddening (|Madordll976l lMadordll982l lOpolskilll983| ) 

W = rpi - R(r P i - ipi) = r Plfi - R(r P i i0 - i P i, ). (8) 

With rpi = rpi.o + A rrl , ipi = ipi.o + Ai pl and A rpl and Ai pl being the extinction corrections in the rpi and ipi 
bands we can derive the coefficient: 

rpi,o + A rpl - R(rpi fi - ipifl) - R(A rrl - A lpl ) = r P i !0 - R{rpi,o - ipi,o) 



^R = 



(9) 



The galactic extinction coefficients gni-yl an< ^ e(b"-v) are 

(|Tonrv et al.ll20"Tl : 

-^V) = 2 - 585 - °- 0315 • (5Pi,o - i P1 ,o) 
sfe-v) = L908 - °- 0152 ' (ffPi,o ~ *pi,o) 

iFiorentino et al.l ()2002|) provides the position of the instability strip for Cepheids depending on and T e g (L is the 
solar luminosity and T c g the effective Temperature). We combine this with the isocrones from TMarigo etaLl (2008) 
with the corrections bv lGirardi et al.l (|2010l) with Z = 0.019, for different and T e g, thus allowing us to determine 
the theoretical location of the Cepheids in the color-magnitude diagram, as shown in the left panel of Fig. 14.21 The 




rpi,o-ipi,o rpi,o-ipi.o 

Fig . 4. — Domain of the instability strip for Cepheids used for the Wesenheit-color cut. A distance modulus of 24.36 {Vilar dell et alj 
2010) has been applied to the magnitudes. Left panel: Location of the instability strip for Cepheids in the magnitude-color plane according 
to instability theory. Right panel: Location of the instability strip for Cepheids in the Wesenheit-color plane according to instability 
theory. The area inside the dashdotted lines encloses the instability strip and the area within the dashed lines is where we require the 
mean magnitude of the light curve to be in, within their error margin. To define this domain we enlarged the boundaries of the dashdotted 
region by 0.2 mag in both color directions and by 0.2 mag in direction of brighter Wesenheit magnitudes. 



right panel of Fig. 14. 2 1 shows the domain defined by this theoretical prediction in the color- Wesenheit diagram (defined 
by a triangle with the corners (0.036/21.50), (0.275/16.56) and (0.59/16.56)). A candidate is qualified as a Cepheid 
variable star if its mean magnitude is located inside this triangle (including its la error). Similar to Fig. 14.21 we also 
obtained the location of the Cepheids in the gpifl vs. gpi.o — ipi.o plane, see Fig. [SJ For (<7pi ; o — ipi.o) — 1-0 and 

from Eq.[TU]we get R f^ F J V ) — 2.5535, E f^ v) = 1-8928 and R = 3.86. Since the (gpi.o — ipi.o) correction is small, the 

«pi o) = 1-0 a reasonable choice. 
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gpi,o - ipi.o 

Fig. 5. — Domain of the instability strip for Cepheids used to determine the (ffpi o — *Pl,o) correction in the galactic extinction coefficients. 
Since the correction is small, (gpi.o — *P1 o) = 1-0 is a reasonable choice. 



While the Wesenheit magnitude is independent of extinction, the color is not. Therefore we apply an extinction correc- 
tio n to the colors. We us e the E(B — V) map from Mo ntalto et al.1 (|2009D and the foreground ext inction of M31 derived 
by iSchlegel et al.1 (1998) with E{ S (B — V) = 0.062 and apply the recalibration factor of 0.86 (jSchlafiv fc Finkbeinerl 



1201 If) , to determine a mean extinction for each light curve depending on its (projected) position: 

- — _ E(B-V) + 0.86 -0.062 A rpi 

2 ' E(B - V) { ' 

E(B - V) + 0.86 • 0.062 A ipi 



Alpl ~ 2 ' E(B-V) (12) 

Note that for Cepheids that are outside the Montalto et al. map we only corrected for the foreground extinction. 
Obviously we could use an extinction corrected magnitude and use it instead of the Wesenheit, but since the correction 
is associated with a large systematic uncertainty it is better to use the Wesenheit, which does not require an extinction 
correction. The Wesenheit thus has a higher statistical error, but a lower systematic error relative to the extinction 
corrected magnitudes. 

With the Wesenheit and the extinction corrected (rp^o — ipi.o) color we are able to select only those light curves 
belonging to the instability strip. This Wesenheit-color, applied to the 75362 periodic light curves detected by SigSpec, 
excludes 70% of the light curves. 



4.3. Manual classification 

In order to find out in which part of the P, A21, ^21 3-dimensional parameter space the Cepheids are located, we need 
a 'training' set of Cepheids which we obtain from a manual classification, i.e. by visual inspection of the preselected 
light curves. 

The visual inspection of a subsample of Cepheid candidates and admission or rejection of Cepheids based on the light 
curve can lead to a bias if the Cepheid subsample itself does not represent all Cepheid types. The most unbiased way 
to select a Cepheid candidate subsample for manual classification is to use the spectral significance (sig). On the one 
hand the significance threshold has to be chosen low enough to contain a fair amount of Cepheids, so that there are 
enough Cepheids for the exploration of the 3 dimensional parameter space. On the other hand, we do not want the 
Cepheid candidate subsample to contain so many Cepheids already, that the classification with the parameter space 
does not detect significantly more Cepheids than the subsample. 

For the manual classification we thus selected all light curves that have s ig > 25 in the rpiband. A s reference light 
curves for the manual classification we used the Cepheid light curves from ISterken fc Jaschekl (|2005| ) . 
The resulting manual classified subsample contains 1020 Cepheids. This subsample makes up ~ 50% of the final 
Cepheid sample. 

4.4. Type classification 

As already mentioned in Section |4~T1 the Fourier decomposition can be used to distinguish between FM Cepheids and 
FO Cepheids. This is done by identifyin g different locations of the Cepheids in the A21-P and (f2i~P planes (e.g. 
lUdalski et al.lll999allVilardell et al.ll2007| ). Fig. |6] shows these projections for the manually classified Cepheid catalog. 
When looking at the Cepheids in a 3-dimensional plot in A21-VJ21-P space, it becomes clear that there are indeed three 
different relatively well separated distributions with only small overlap areas. 

The FM Cepheids correspond to the 'V-shaped' sequence that can be seen in in the left panel of Fig. [51 In a 3- 
dimensional plot the FM sequence appears spiral shaped and the two branches are shifted slightly from one another. 
The FO Cepheid sequence is perpendicular to the FM sequence, but there is also an overlap regime between them 
where FM and FO objects can be hardly distinguished. For objects in or near this overlap area the classification into 
FM or FO is therefore rather uncertain as a real p arameter overlap seem s to be present. 

We additionally find a third sequence not present in Udalski et al.l (|1999al ) and lVilardell et a l. (2007) and corresponding 
to Type II Cepheids. This sequence is more distinct from the FM Cepheids than the FO Cepheids, nevertheless there 
is also a transition between the two. 

In a first step we define the parameter space where most of the FO Cepheids and the Type II Cepheids are located 
and assign the according type to the Cepheid. We use 

P < 6.0 d and A 21 < -0.498 log(P) + 0.528 (13) 

for the FO parameter space. 
For the Type II space we use 

11.95 d < P < 53.0 d and p 2 i > 5 rad. (14) 

The remaining Cepheids are FM Cepheids. Fig. [5] shows the three different sequences. Nevertheless there are 
transitions between the sequences. There are FM Cepheids in the FO and Type II parameter space and FO Cepheids 
or rather Type II Cepheids in the FM sequence. In order to account for these ambiguous cases we identify these 
Cepheids in a second step by using the Period - Wesenheit relation. By using the Wesenheit and not the rpi band we 
get a classification independent of reddening. We perform iterative 3cr clipping for a linear relation (W = a-\og(P) + b, 
c.f. Section [^41 for each Cepheid type and classify all clipped Cepheids and all those Cepheids with a large error in 
the Wesenheit (AW > 0.9) as unclassified (UN) Cepheids. Fig. [7] shows that this classification works well, but also 
indicates the problem with this approach. Cepheids that are e.g. outside the defined Type II parameter space will be 
classified as UN Cepheids although it is apparent from Fig. [7] that they are most likely Type II Cepheids. 




P[d] P[d] 

Fig. 6. — 3-dimensional parameter space for the manually classified Cepheid catalog in the rpi band. The scope defined by the magenta 
dashed lines defines the parameter space in which most of the FO Cepheids (P < 6.0 d and A21 < —0.498289 * log(P) + 0.527744) or rather 
most of the Type II Cepheids (11.95 d < P < 53.0 d and if2i > 5rad) reside in, while the FM Cepheids occupy the remaining parameter 
space. Since there are transitions between the different scopes, we perform a 3<r clipping in the Period - Wesenheit relation (cf. Fig. 
for each Cepheid type. All the clipped Cepheids and all those Cepheids with a large error in the Wesenheit (A14 7 > 0.9) are classified as 
unclassified (UN) Cepheids. Left panel: Amplitude ratio A21 in the rpiband. Right panel: Phase difference </J2i in the rpiband. 




P[d] 

Fig. 7. — Period - Wesenheit relation for the manually classified Cepheid catalog. The Period - Wesenheit relation is used to identify 
those Cepheids that reside bet ween the type classification scopes (cf. Fig. [jjj. This is done by performing 3<r clipping for a linear relation 
(W = a-log(P) + b, cf. Section l6.4H for each Cepheid type (FM Cepheids: blue symbols and line; FO Cepheids: red symbols and line; Type 
II Cepheids: green symbols and line). All the clipped Cepheids and all those Cepheids with a large error in the Wesenheit (A14 7 > 0.9) are 
classified as unclassified (UN) Cepheids. 



4.5. Selection criteria 

With a visual inspection of the light curves only a color cut is needed to identify the Cepheids. Since we want to 
identify Cepheids using the P, A21, ^21 3-dimensional parameter space cut we have to apply additional selection 
criteria to exclude bad light curves. Note that the following criteria can be applied in any order. Table Q] summarizes 
the selection cuts. 

The first two criteria (I & II) concern the period of the light curve. As already mentioned in Section [3] we have 
not applied a period cut yet. Our data covers approximately 150 days in 2010 and a few days only in 2011. While 
inspecting the light curves by eye, these few days in 2011 can help to confirm that the detected period is valid, even 
if the period is ~ 100 d. However since there is a gap of approximately half a year between the 2010 and 2011 data 
with only few days of observations in 2011, the light curve could be irregular in the time span of the gap and the 
data points in 2011 could fit to the folded light curve by chance. We decided therefore to select only light curves with 
periods below 75 days (I) , so that the whole period cycle is sampled at least twice in the 2010 data. We note that 
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there could very well be Cepheids with larger periods than that, but we will be able to confirm those only after the 
release of the full 2011 data. The second criterion (II) is that the period determined in the zpi filter differs from that 
in the rpi filter by less than one percent. This criterion excludes most of the light curves (84 %). 
We do not require a light curve to contain more than a certain number of epochs. The reason for this is that we use 
the Fourier series fit and the polynomial fit to account for gaps in the data that might be problematic and produce 
'overshoots'. Most of our light curves have ~ 180 epochs, but there can also be light curves with as few as 60 epochal. 
The left panel of Fig. [5]shows the distribution of the number of epochs for the 3-dimensional parameter space classified 
Cepheid catalog. The right panel of Fig. [8] shows that there are only a few Cepheids with a small number of epochs 
when normalized to the period of the light curve in the 3-dimensional parameter space classified Cepheid catalog. A 
visual inspection shows that these light curves constitute Cepheids. This indicates that we need no criterion requiring 
a certain number of epochs when applying the three 'overshoot' criteria (The classification of the different Cepheid 
types shown in Fig. [8] is described in the next Sections). 

The criteria III & IV exclude extreme 'overshoots' by selecting only those light curves, where the magnitudes m(t) of 
the fitted light curve are reasonable: m(t) < 30 mag and m(t) > 15 mag. 

Note that criteria I to IV are applied to both rpi and ipi band. All other criteria only concern the rpi band. 
In order to also exclude less extreme 'overshoots', we apply criterion V, which selects only light curves with similar 
Fourier series fit and polynomial fit. We check for similarity by calculating the area between both fits and demand that 
p 

the area / (m(t) — p(t)) 2 dt is below a threshold. The median of this area is ~ 0.2 mag • d and the largest area is ~ 13 
o 

for the 3-dimensional parameter space classified Cepheid catalog. So our threshold of 30 is quite high when compared 
to those numbers, but the 'overshoots' usually produce orders of magnitude larger values than this threshold. The 
choice of the threshold is arbitrary to some degree, although the threshold of 30 has proven to work well. 
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Fig. 8. — Epoch distribution for the 3-dimensional parameter space classified Cepheid catalog in the rpi band. Left panel: Distribution 
of the number of epochs. Right panel: Distribution of the number of epochs divided by the period. 



The last two selection criteria (VI & VII) concern the noise in the folded light curves. If the maximum and minimum 
of the Fourier series fit are similar, we cannot exclude that the light curve is constant and the variation is caused by 
noise. Additionally light curves affected by blending have smaller amplitudes. Therefore we select only those light 
curves with: max(m(<)) — min(m(t)) > 0.1 mag. The \ 2 of a fit can be low, although the scatter is large, if the errors 
in the data points are large enough. Therefore we introduce a last selection criterion (VII), which concerns the scatter 
of the light curve. We want to allow for a few points to scatter, that is why we compare the median scatter of the 
Fourier series fit with the total change in magnitude: median [\m(t) — y(t)\] < 0.2 (max(m(£)) — min(m(i))) {m(t) is 
the magnitude of the Fourier series fit at time t and y(t) is the measured magnitude at time t). The threshold of 0.2 
was optimized by a visual inspection. 

4.6. 3-dimensional parameter space classification 

After the selection criteria are applied (c.f. Section |4~5|) we are left with some objects with well defined light curves 
that are not Cepheids. This becomes obvious if we compare their location in the 3-dimensional parameter space (A21- 
<^2i-P) with that of our manually classified Cepheids. Therefore we use the manually classified subsamplc to define 

13 60 is the lowest number of epochs for a light curve associated to a Cepheid in the 3-dimensional parameter space classified Cepheid 
catalog. 
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TABLE 1 

Selection criteria for the light curves. P is the period, m(t) the magnitude of the Fourier series fit at time t, p(t) the 

POLYNOMIAL FIT AT TIME t AND y(t) IS THE MEASURED MAGNITUDE AT TIME t. THE LAST COLUMN IS THE PERCENTAGE EXCLUDED BY THE 
CRITERION (IF EACH CRITERION IS APPLIED INDIVIDUALLY) FROM THE 75362 PERIODIC LIGHT CURVES DETECTED BY SlGSPEC. 





Selection criterion 


percentage excluded 


I 


P rpl < 75 d 


66% 


II 


K. — P 
rp i lpl | < 0.01 
r pi 


84% 


III 


m(t) < 30 mag 


19% 


IV 


m(t) > 15 mag 


20% 


V 


p 

/(m(t) -p(t)) 2 dt < 30 



31% 


VI 


max(m(t)) — min(m(f)) > O.lmag 


4% 


VII 


median(\m(t)-y(t)\) ^ ° mac 
max{m{t))—min{m{t)) — ' ^ 


42% 



a '3-dimensional parameter space' occupied by Cepheids. All variables surviving the selection criteria in Section 14.51 
which at the same time reside in the 3-dimensional Cepheid subspace are considered Cepheids as well. The complexity 
of this 3d subspace is so high that there are no 2d projections that could classify Cepheids as effectively. 
We span a 10 x 10 x 10 gricf3 in A 2 i-<p 2 i-P space with 1 < log(P) < 75, < A 2 i < 0.75 and < ip 2 i < 2tt. We then 
identify those boxes around the grid-points where the Cepheids of the manually classified subsample reside in. The 
space defined by those grid-points that have Cepheids around them, defines the 3-dimensional parameter space. The 
subsample size and the rather large grid span force us to accept grid-points that only have one data point around it. 
While this is not ideal, the alternative would be to accept only grid-points that have a number of Cepheids above a 
certain threshold and this would drastically decrease the number of Cepheids at large periods, Type II Cepheids and 
FO Cepheids. 

After the Wescnhcit-color cut and the selection criteria are applied there are 2030 light curves within this parameter 
space, including also those already identified with the manual classification. As stated in Section 02 these light 
curves can contain RV Tauri light curves. We inspect those 2030 light curves manually and find 21 RV Tauri light 
curves, which we exclude from the 3-dimensional parameter space classified Cepheid catalog. The final 3-dimensional 
parameter space classified Cepheid catalog contains 2009 Cepheids (including the 1020 manually classified Cepheids). 




1.0 10.0 100.0 1.0 10.0 100.0 

P[d] P[d] 

Fig. 9. — 3-dimensional parameter space for the P, A21, <fi2i 3 dimensional parameter space classified Cepheid catalog in the rpi band. 
The scopes for the type classification are the same as in Fig. [6] The transitions between the different sequences become more pronounced 
when compared to those in the manually classified Cepheid catalog, while the shape of the sequences is preserved (c.f. Fig. [B). Left 
panel: Amplitude ratio A21 for the 3-dimensional parameter space classified Cepheid catalog. Right panel: Phase difference (£21 for the 3 
dimensional parameter space classification. 

As can be seen in Fig. [5] the transition region between the different Cepheid types becomes more populated, in 



A grid with more grid-points could leave gaps, which we want to avoid. 
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TABLE 2 

Excerpt of the 3-dimensional parameter space classified Cepheid catalog. Only a portion of the complete table is shown 
here. The table in its entirety is available in machine-readable format in the online journal. The columns contain (from left 
to right): identifier, Cepheid ty-pe, RA (J2000.0), DEC (J2000.0), significance (sig), period (determined from the rpi band light 

curve), mean magnitude in the rpi band, mean magnitude in the ipi band, Fourier coefficient A21, Fourier coefficient 1^21, 

Wesenheit index and the decline/rise factor (c.f. Fig. I2TI1 



identifier 


type 


RA 


DEC 


sig 


1' 




rpi 








ipi 








A21 






¥21 






W 






decl./rise 


103-27590 


FM 


11.1608 


42 


1585 


16.40 


4.398 


21 


34 


± 





08 


21 


20 


± 





08 





37 


± 





03 


4 


37 


± 





11 


20 


80 


± 





38 


2.80 


040-12157 


FM 


9.8918 


40 


4165 


16.09 


6.304 


21 


10 


± 





05 


20 


92 


± 





05 





11 


± 





03 


1 


66 


± 





10 


20 


40 


± 





25 


2.24 


102-03621 


FM 


11.0967 


11 


9573 


19.07 


10.224 


20 


01 


± 





02 


19 


80 


± 





05 





19 


± 





01 


4 


87 


± 





07 


19 


09 


± 





18 


2.14 


103-12114 


FM 


11.3482 


12 


0301 


18.53 


10.370 


20 


73 


± 





10 


20 


10 


± 





10 





13 


± 





02 


4 


32 


± 





15 


19 


47 


± 





18 


1.15 


064-30826 


FM 


10.1540 


41 


2511 


32.34 


16.563 


19 


84 


± 





03 


19 


63 


± 





03 





32 


± 





01 


1 


78 


± 





02 


19 


02 


± 





14 


1.94 


053-14189 


FM 


10.5470 


10 


8116 


28.07 


27.591 


19 


46 


± 





02 


19 


10 


± 





02 





11 


± 





01 


1 


80 


± 





02 


18 


06 


± 





11 


4.38 


065-11258 


FM 


10.2662 


11 


1512 


33.25 


45.740 


18 


78 


± 





02 


18 


18 


± 





04 





13 


± 





00 


4 


89 


± 





01 


17 


61 


± 





15 


3.62 


090-30644 


FM 


11.0918 


41 


9226 


18.76 


66.429 


18 


55 


± 





02 


18 


20 


± 





03 





19 


± 





01 


1 


91 


± 





01 


17 


21 


± 





12 


1.82 


051-15274 


FO 


9.7241 


40 


7097 


33.30 


2.730 


21 


39 


± 





08 


21 


33 


± 





08 





08 


± 





04 


4 


85 


± 





41 


21 


17 


± 





39 


1.14 


090-07109 


FO 


10.6804 


41 


6543 


30.35 


3.683 


21 


23 


± 





03 


20 


91 


± 





05 





13 


± 





01 


4 


35 


± 





31 


20 


02 


± 





20 


1.60 


041-31081 


T2 


10.5598 


40 


5848 


29.91 


21.514 


21 


47 


± 





03 


21 


23 


± 





01 





33 


± 





03 


5 


10 


± 





09 


20 


56 


± 





20 


3.19 


078-12906 


T2 


10.9476 


41 


4886 


27.59 


33.832 


20 


92 


± 





05 


20 


79 


± 





07 





11 


± 





03 


5 


56 


± 





07 


20 


11 


± 





33 


2.01 


103-01636 


UN 


11.4924 


41 


9421 


16.31 


5.251 


20 


39 


± 





03 


20 


08 


± 





05 





37 


± 





03 


1 


14 


± 





09 


19 


22 


± 





19 


3.38 


077-16136 


UN 


10.5815 


41 


4385 


28.22 


26.096 


20 


17 


± 





21 


20 


08 


± 





21 





12 


± 





01 


1 


81 


± 





01 


18 


98 


± 


1 


02 


4.68 



comparison to the manual classified Cepheid catalog (c.f. Fig. [5]). This makes the classification of the UN Cepheids 
with the Period - Wesenheit relation (Fig. ITOl) even more important. Of the 75362 periodic light curves detected by 
SigSpec the 3 dimensional parameter space cut alone excludes 94%. 




1.0 10.0 100.0 

P[d] 

Fig. 10. — Same as Fig. [7] but for the 3-dimensional parameter space classified Cepheid catalog. 

It is important to note that while this procedure works well, there are Cepheids that are excluded by this strict 3- 
dimensional parameter space classification cut. This is due to the grid size and the sampling of the parameter space 
through the manually classified subsample. 

5. THE CATALOG 

The 3-dimensional parameter space classified Cepheid catalog contains a total of 2009 Cepheids, consisting of 1440 
FM Cepheids, 126 FO Cepheids, 147 Type II Cepheids and 296 UN Cepheids. 

Table [5] contains an excerpt of the 3-dimensional parameter space classified Cepheid catalog (the corresponding light 
curves are shown in Fig. [fU Fig. Q21 Fig. [13] Fig. HH Fig. HH Fig. [16] and Fig. [17]). Table [2] is available in its entirety 
in machine-readable format in the online journal. A portion is shown here for guidance regarding its form and content. 



14 



JD - 2450000 

5400 5450 5500 5550 5600 5650 5700 5750 5800 



20.6 

20.8 

21.0 

21.2 

"21.4 

21.6 

21.8 

22.0 
20.6 

20.8 

21.0 

21.2 

"21.4 

21.6 

21.8 

22.0 



I 111 * 



t z 




-1 







1 



2 

P-0 



JD - 2450000 

5400 5450 5500 5550 5600 5650 5700 5750 5800 



20.4 
20.6 
20.8 
-21.0 
21.2 
21.4 
21.6 
21.8 

20.4 
20.6 
20.8 

_21.0 

& 

21.2 
21.4 
21.6 
21.8 



t*+ ++ Hi 



y * 



i,li hi 








p-0 



Fig. 11. — Light curves in the rpi band for a part of the excerpt of the 3-dimensional parameter space classified Cepheid catalog. Each 
top panel shows the unfolded light curve, while each bottom panel shows the variation of the folded light curve over the period times the 
phase 0. The folded light curve has been continued in both directions for better illustration, only the middle part between the dashed lines 
shows the real folded light curve. The grey circle data points represent the 2010 season, while the gray triangle data points are within the 
2011 season. The Fourier series fit is the black soli d line and the black dashed line represents the light curve that is only defined by A21, 
ip2l and the mean magnitude rpi (c.f. Section k.lj l, Left panel: Light curve of a FM Cepheid with P = 4.398 d. Right panel: Light curve 
of a FM Cepheid with P = 6.304 d. 



JD - 2450000 JD - 2450000 

5400 5450 5500 5550 5600 5650 5700 5750 5800 5400 5450 5500 5550 5600 5650 5700 5750 5800 




Fig. 12. — Same as Fig. Illl for the next two entries in the excerpt of the 3-dimensional parameter space classified Cepheid catalog. Left 
panel: Light curve of a FM Cepheid with P = 10.224 d. Right panel: Light curve of a FM Cepheid with P = 10.370 d. 
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Fig. 13. — Same as Fig. Illl for the next two entries in the excerpt of the 3-dimensional parameter space classified Cepheid catalog. Left 
panel: Light curve of a FM Cepheid with P = 16.563 d. Right panel: Light curve of a FM Cepheid with P = 27.591 d. 
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Fig. 14. — Same as Fig. Illl for the next two entries in the excerpt of the 3-dimensional parameter space classified Cepheid catalog. Left 
panel: Light curve of a FM Cepheid with P = 45.739 d. Right panel: Light curve of a FM Cepheid with P = 66.429 d. 
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Fig. 15. — Same as Fig. Illl for the next two entries in the excerpt of the 3-dimensional parameter space classified Cepheid catalog. Left 
panel: Light curve of a FO Cepheid with P = 2.730 d. Right panel: Light curve of a FO Cepheid with P = 3.683 d. 
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Fig. 16. — Same as Fig. Illl for the next two entries in the excerpt of the 3-dimensional parameter space classified Cepheid catalog. Left 
panel: Light curve of a Type II Cepheid with P = 21.513 d. Right panel: Light curve of a Type II Cepheid with P = 33.831 d. 
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Fig. 17. — Same as Fig. Illl for the next two entries in the excerpt of the 3-dimensional parameter space classified Cepheid catalog. Left 
panel: Light curve of a UN Cepheid with P = 5.251 d. Right panel: Light curve of a UN Cepheid with P = 26.095 d. 



We match the position of the Cepheids in our sample with the 416 Cepheids from I Vilardell et al.l (|2007| ) and found a 
coincidence for 225 Cepheids (187 FM, 9 FO, 2 Type II and 27 UN Cepheids) with a matching radius of 1 arcsec. The 
periods of both samples show a good agreement (Fig. [T8|) . The light curve of the Cepheid with the largest difference 
in the period (5.101 days) is shown in the right panel of Fig. 1161 The reference image shows no other close source 
and as can be seen from the right panel of Fig. [TTJ] the period determination is reasonable, which suggests a slightly 
wrong period in the Vilardell sample. The periods for the other 5 light curves with a relative difference larger than 
0.5% we were able to confirm by visual inspection, but in some cases we can not rule out that there is another close 
source within the 1 arcsec matching radius and that the difference is caused by that sour ce. 

We also matched our Cepheids with the Cepheids of the DIRECT project (Macri 2004) and the WeCAPP project 
(jFliri et al.l 120061 ) with the matching radius of 1 arcsec. We found matches for 216 Cepheids from 332 DIRECT 
Cepheids (187 FM, 3 FO, Type II and 26 UN Cepheids) an d 26 matched C e pheid s from 126 WeCAPP Cepheids (15 
FM, FO, 4 Type II and 7 UN Cepheids). Taking also the iVilardell et al.l (|2007t ) sample into account we matched 
354 unique Cepheids. 




P[d] 

Fig. 18. — Relative difference of our periods with the 225 (187 FM (cir cles), 9 FO (upwards pointing triangles), 2 Type II (downward 
pointing triangles) and 27 UN Cepheids (squares)) matched Cepheids from Vilardell et al. (2007). The light curve of the Cepheid with the 



largest difference in the period is shown in the right panel of Fig. 1161 The cascaded form of the relative difference is due to the precision 
of the compared values. 
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Fig. 1191 shows the color magnitude diagram and the color Wesenheit diagram for all resolved sources in the rpi band 
and the Cepheids of the 3-dimensional parameter space classified Cepheid catalog. Left panel of Fig. [20] shows the 
color Wesenheit diagram for the Cepheids alone. We require that the Cepheids lie within the domain enclosed by the 
dashed line which results from theoretical predictions of the instability strip (see Section |4~2|) . 

Using a simple color cut for the selection of Cepheids would either yield too many interlopers (wide cut) or cut too 
many Cepheids (narrow cut) as a simple color cut can not account for the tilt of the instability strip in the CMDs 
(c.f. right panel of |2"D"1) . The use of the instability strip allows for a strict sample cut and yields a color distribution 
dominated by Cepheid stars (c.f. left panel of Fig. [20)) . 




rpi - ipi rpi - ipi 

Fig. 19. — Color Wesenheit Diagram and Color Magnitude Diagram for all the 724894 point sources that were detected in the rpi band. 
Cepheid variable stars are color coded. Left panel: Color Magnitude Diagram. Right panel: Color Wesenheit Diagram. 
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Fig. 20. — Left panel: Color Wesen heit Diagram with the domain (black dashed lines) we require the data points to be in, consistent 
with their margin of error (c.f. Section l4.2l l. Right panel: Histogram of the color distribution. 

To describe the overall shape of the Cepheid light curve and to trace the bump progression (also known as Hertzsprung 
progression. iBono et al.ll2000D we introduce the decline/rise factor. This factor describes the ratio of the time from the 
minimal to the maximal magnitude (not flux!) compared to the time from the maximal to the minimal magnitude. 
Fig. [2T1 shows that there are different scopes that correspond to the different positions of the bump. Around a period 
of 10 days two bumps occur on each side of the minimal magnitude and the light curve gets quite symmetrical. This 
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can be verified in our sample light curves in Fig. 1121 
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Fig. 21. — Decline/rise factor for the 3-dimensional parameter space classified Cepheid catalog. This factor describes the ratio of the 
time from the minimal to the maximal magnitude (not flux!) compared to the time from the maximal to the minimal magnitude and it is 
a good indicator of the overall shape of the light cu rve. We obse rve diffe rent scop es th at c orre spon d to the pos ition of the bump. The 14 
Cepheids framed by a black circle are shown in Fig. [TT] Fig. 1121 Fig. [T3] Fig. [TJ] Fig. [Tg] Fig. [T5]and Fig. \T7\ 

6. RESULTS 

So far, the most complete catalog of Cepheid variable stars in M31 had been presented by iVilardell et ail (|2007j) . 
They found 416 Cepheids of which they classified 281 as FM and 75 as FO objects. We here present a dramatically 
larger catalog with 2009 candidates, almost 5 times the n umber of object s in th e earlier catalog. The factor ~ 4.8 
difference has to be attributed to the different patrol fields: IVilardell et al.l (|2007D observed 0.32 square degree within 
north-eastern lobe of the disk of M31 (and some contribution from bulge and halo) while our Pan-STARRS 1 data 
cover the full disk, the complete bulge and a large fraction of the M31 halo (only a small part of the halo has been 
analyzed for this work). 

At the distance of M31 (750 to 770 kpc), an arcsecond corresponds to ~3.7 pc. Therefore, ground-based imaging of 
individual stars within M31 is subject to blending and crowding, especially for young stars like Type I Cepheids which 
are often located in young clusters or associations. Eve n at the resolution of t he HST came ras, blending i s an is sue. 
The i mpact of blen d ing h as been discussed in detail by IVilardell et al.l (|2007t ) for M31 and iChavez et al.l (|2012D for 
M33. IChavez et all (I2012T) - comparing ground based images to HST images - find severe effects (> 10 % level) for 
more than 30 % of the stars in V and /. 

6.1. FO-to-FM ratio 

Interestingly, the relative amount of FO stars as compared to the FM pulsators differs between Vilardell et al. and 
our findings, namely, 0.27 vs. 0.09. Both values have their largest uncertainties in the number of unclassified objects 
as it is rather uncertain to which of the two pulsational modes these objects belong. Incomplete detection can further 
add uncertainties. As the Vilardell et al. data were obtained with a slightly larger telescope, and with B, V filter 
bands, they might have a better completeness for the on average bluer and fainter FO stars. Therefore, it is eve n more 
puzzling that the FO-to-FM number ratio based on the David Du nlap Observatory S ample (|Fernie et al.lll995l) of the 
Milky Way (~ 0.06) and of the General Calatog of Variable Stars (jSamus et alj|2012l) within the Milky Way (~ 0.13) 
matches nicely our ratio. 

We note that the FO-to-FM ratios as derived from the OGLE surveys of the l ower metallicity stars forming dwarf 
galaxies LMC, SMC, and IC 1613 yield ratios in the range 0.4 to 0.7 (see e.g. lUdalski et al.lll999bl and references 
therein, and the OGLE project web pages for the catalogs). Comparing these values to the one of the two spiral 
galaxies might be indicative for a metallicity trend, despite the rather large scatter of the data in this possible relation. 
FO Cepheids preferentially popu late the blue boundary of the instability strip. As isochrones easily demonstrate (see 
e.g. Fig. 1 in Becker ct al.lfl977l ). lower metallicity objects have blue loops in post main sequence evolutions which 
extend far to the blue and cross the complete strip while higher metalicity shrinks the extent more to the red and the 
strip is no more fully crossed. Therefore, a trend between host metallicity and FO-to-FM ratio is not surprising. 

6.2. Amplitude ratio 

It is well known that the amplitude of light variations of Cepheids decreases from blue to red wavebands. In several 
recent investigations of extragalactic samples, the ratio between the amplitudes in two bands has been used to select 
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Cepheid candidates. We do not use the amplitude ratio for selecting good Cepheid candidates, but derive a ratio 
between the rpi and ipi of ~ 1.3 with a slight dependence on the period (see Fig. [22] for our M31 sample). This is in 
excelle nt agreement with the high quality photometry of galactic Cepheids (FM and FO) by Wisnic wski fc Johnson! 
(1968). From their light curves we find an average ratio in R and / of 1.28 ± 0.09, and their data points are very well 
represented by our fit to the M31 sample data (including the slope). 




P[d] 

Fig. 22. — Amplitude ratio for the 3-dimensional parameter space classified Cepheid catalog. We observe a slight slope (A rpl /Ai rl = 
a + b ■ log(P)) with a = 1.309 ± 0.006, b = -0.0478 ± 0.0014 and a dispersion of 0.1680. 

6.3. Period distribution 

The FM period distribution of the presented catalog (Fig. |2"3"| shows a double peaked-distribution with peaks at ~ 0.75 
and 1.1 in log(P) [~ 5.5 days and ~ 12.5 days), respectively. The peaks coincide with the ones found previously by 
IVilardell et al.l ([2007D . Also, the shape of the distribution agrees quite well with the exception that our sample shows 
a steeper cut-off to the shortest periods which probably is related to the somewhat shallower PS1 data. Thus, our 
larger data set fully supports the more general finding in the Milky Way and M31 that objects of solar metallicity show 
a complex per iod distribution as c ompared to e.g. the Mag ellanic Clouds and other low-metal star forming galaxies 
(|Becker et al.lll977l IVilardell et aL1[2007l lAlcock et al1[l999l . 

We note that the period distributions of M 101 and NGC 4258, two other large spirals with more than 800 and 
300 detected Cepheids, r espectively, show similarly double peaked distribution with a local minimum at ~ 10 d 
(jShappee fc Stanekll2011l . [Macri 2004)). Thus, this shape of the distribution seems to be a rather common feature of 
the Cepheid population of large, solar metallicity spiral galaxies. A full discussion of this topic is beyond the scope of 
this paper. 

6.4. PL relation 

The Cepheids of the 3-dimensional parameter space classified Cepheid catalog are used to determine the Period- 
apparent magnitude relations (PL) for the different Cepheid types (Fig. |2~4"1) . To remove systematic outliers that e.g. 
result from blending or the wrong extinction correction^ we additionally perform iterative 3cr clipping (Fig. |2"5]) . But 
in contrast to the 3<r clipping that is used in Section 14.61 for the Period- Wesenheit relation, we use the dispersion of 
the data points relative to the best fit relation (which we calculate for each iteration step) as an error for each Cepheid 
magnitude. Otherwise the small errors would result in the elimination of most of the Cepheids, instead of just clipping 
the systematic outliers. In comparison the use of the dispersion is not necessary for the Period- Wesenheit clipping 
since the errors in the Wesenheit index are large. The slopes, zero points, the numerical fit errors of these values and 
the dispersion of the different relations are given in table [3] for a linear fit of the form: 

m = a- log(P) + b. (15) 

For comparison we also added the period- Wesenheit relation from Fig. [10] and the relations for the manually classified 
Cepheid catalog. The value of the slope a and the values for the intersection b in the usually applied linear approxi- 
mation of the PLR have be en a long standing matter of debate (see Section [T] and literature cited therein) . Recently, 
iFreedman k. Madorel (|2011l ) derived slope values from basic physical principles taking into account the surface temper- 
ature and effective surface area. These slope values depend on wavelength (expressed as observing band dependence) . 



15 N ote that as described in Section l4.2l we only correct for the foreground extinction if there is no color excess given in the Montalto ct al. 
lf2009T ) map. 
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Fig. 23. — Normalized period distribution of the 3-dimensi onal parameter space classified Cepheid catalog. The period distribution of 
the FM Cepheids does show a secondary peak as described in Vilardcll ct al. (2007). 

The predictions of Frcedman & Mad ord (12011I) agree rather well with empirical values of iFouque et al.l (12007D (M ilky 
Way Cepheids) and of INgeow &; Kanburl (20071 ) (LMC) including the wavelength dependence. IBono et al.l (|2010D ob- 
tained slope values from pulsation models, but show an underlying relation with a weak additional curvature term 
(see their Fig. 1). This quadratic behavior i s well approximated by a two-slope lin ear approach with different slope 
for periods shorter and longer than 10 days (INgeow et all 120051 INgeow et al.l l2008h and are in good agreement with 
the broken slope proposed by ISandage et alj (|2009D . I Bono et al.l ()2010h provide their slope pred ictions as a function 
of metalli city and waveband for short, long and overall period range. iDi Criscienzo et ahl <|2012f ) obtain slope values 
similar to Bo no et al.l (|2010f ) from pulsatio n models for differe nt period ranges in the SDSS niters. The obtained slopes 
for Z = 0.02 differ from those obtained by IBono etaLl (|2010l) . 
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Fig. 24. — Period apparent magnitude relations for the 3-dimensional parameter space classified Cepheid catalog. The according fits 
shown as solid lines are given in table [3] Left panel: Period apparent magnitude relation in the rpi band. Right panel: Period apparent 
magnitude relation in the ipi band. 



The slope values we derive are surprisingly shallow. In table |3] we also provide slopes for a subsample restricted to 
Cepheids with periods longer than 10 days. The slopes of the full sample fits in table [3] and those for the subsample 
with P > 10 d yield basically identical values. 

As can be seen in Fig. [21] the shallow slope of the subsample with P > 10 d agrees with the prediction of IBono et al.1 
(2010) for long periods. This is a warning for applying local PLR's to distant galaxies Cepheid samples which have 
even sh allower (absolute) magnitude limit s and are even mo re dominated by long period Cepheids: The slope values 
used bv lRiess et al.l (|2012D for M31 and bv lRiess et al.l (|2011|) for 9 more distant hosts perfectly agree with our findings 
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Fig. 25. — Period-apparent magnitude relations with iterative 3tr clipping. For the error bars in the 3<r clipping we use the dispersion 
of the relation which we calculate for each iteration step. The according fits shown as solid lines are given in tabic [3] Left panel: Period 
apparent magnitude relation in the rpi band. Right panel: Period apparent magnitude relation in the ipi band. 



and with the predictions of IBono et al.l ([2010) for long periods. Thus, the effective period range available for a galaxy 
impacts the derived apparent distance values if the single slope solution for the PL is applied. 



TABLE 3 

Summary of the results of the different period- apparent magnitude relations (PL), period- Wesenheit relations (PLW) and 

THE PERIOD-APPARENT MAGNITUDE RELATIONS WITH ITERATIVE 3(7 CLIPPING. FOR THE ERROR BARS IN THE 3<T CLIPPING (PLC) WE USE 
THE DISPERSION OF THE RELATION WHICH WE CALCULATE FOR EACH ITERATION STEP. ADDITIONALLY A FIT TO A SUBSAMPLE WITH A 

PERIOD RESTRICTION (PLP) FOR FM CEPHEIDS IS PROVIDED. 



Relation 


Catalog 


Type 


Slope [mag/log(d)J 


Zero Point [mag] 


Dispersion [mag] 


PL (rpi band) 


3d 


FM 


-2.39370 ± 0.00110 


22.66600 ± 0.00380 


0.42500 


PLC (r P1 band) 


3d 


FM 


-2.35440 ± 0.00930 


22.73900 ± 0.03390 


0.35010 


PLP (rpi band, P > 10 d) 


3d 


FM 


-2.39060 ± 0.00160 


22.66420 ± 0.00980 


0.55550 


PL (rpi band) 


manual 


FM 


-2.39800 ± 0.00130 


22.63410 ± 0.00490 


0.40700 


PL (rpi band) 


3d 


FO 


-2.56090 ± 0.00370 


22.23350 ± 0.01800 


0.35040 


PLC (r P1 band) 


3d 


FO 


-2.34510 ± 0.02650 


22.18960 ± 0.12520 


0.29360 


PL (rpi band) 


manual 


FO 


-2.46520 ± 0.00680 


22.14830 ± 0.02920 


0.23840 


PL (rpi band) 


3d 


T2 


-2.68030 ± 0.00380 


25.03480 ± 0.03850 


0.38690 


PLC (r P1 band) 


3d 


T2 


-2.27150 ± 0.02850 


24.51130 ± 0.28290 


0.34340 


PL (rpi band) 


manual 


T2 


-2.40420 ± 0.00720 


24.61830 ± 0.06830 


0.30380 


PL (ipi band) 


3d 


FM 


-2.50490 ± 0.00130 


22.61890 ± 0.00470 


0.35710 


PLC (ipi band) 


3d 


FM 


-2.48490 ± 0.00830 


22.66520 ± 0.03000 


0.31090 


PLP (ipi band, P > 10 d) 


3d 


FM 


-2.43540 ± 0.00190 


22.52990 ± 0.01240 


0.43030 


PL (ipi band) 


manual 


FM 


-2.46910 ± 0.00160 


22.55290 ± 0.00610 


0.33780 


PL (ipi band) 


3d 


FO 


-2.84470 ± 0.00490 


22.31370 ± 0.02380 


0.31750 


PLC (ipi band) 


3d 


FO 


-2.64490 ± 0.02290 


22.22470 ± 0.10760 


0.25230 


PL (ipi band) 


manual 


FO 


-2.94140 ± 0.00840 


22.36100 ± 0.04000 


0.24290 


PL (ipi band) 


3d 


T2 


-2.63900 ± 0.00460 


24.81500 ± 0.04680 


0.34530 


PLC (ipi band) 


3d 


T2 


-2.57460 ± 0.02710 


24.72140 ± 0.26840 


0.32680 


PL (ipi band) 


manual 


T2 


-2.68790 ± 0.00830 


24.83940 ± 0.07920 


0.30570 


PLW 


3d 


FM 


-3.00260 ± 0.00620 


22.57140 ± 0.02180 


0.44290 


PLW 


manual 


FM 


-3.00380 ± 0.00760 


22.57660 ± 0.02830 


0.40260 


PLW 


3d 


FO 


-3.57730 ± 0.02210 


22.43020 ± 0.10730 


0.41600 


PLW 


manual 


FO 


-3.59930 ± 0.03830 


22.50680 ± 0.17810 


0.39570 


PLW 


3d 


T2 


-3.12580 ± 0.02140 


24.98560 ± 0.21670 


0.51350 


PLW 


manual 


T2 


-2.89640 ± 0.03900 


24.60340 ± 0.36910 


0.49050 



In a next step we want to check if our sample is dominated by long period Cepheids. If our sample would be dominated 
by the long perio d Cepheids, this would explain why the slopes of our full sample do not agree with the slopes of 
IBono et al.l (|2010D for all Cepheids. Therefore we want to compare the slopes of the short period (P < 10 d) Cepheids 
to those of the long period Cepheids. Simple linear fits to the short period samples provide almost similar slopes to 
those of to the long period samples, but with a different zero points. This means that the fits for the different period 
ranges are not continuous at 10 days. Hence we introduce a different procedure for the fit of the broken slope proposal. 
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Fig. 26. — PLR slope values as function of wavelength. Our full sample an d subsample with P > 10 d yield basically identical slopes 
(color: cyan). They agree with the theoretical predictions of Bono ct al. (2010) for long periods (color: brown). 

We fit two linear slopes (a\ and a 2 ) with a common suspension point (t/10) at 10 days {xw = 10 d): 

x 



y 



y 



01 log( ) 

Xio 



2/io 



X < X 



a 2 log( ) + y w 

xio 



10 



x > XlQ 



(16) 
(17) 



For the FM Cepheids in the rp 1 band this results in 

y w = 20.26755 ± 0.000004 
01 = -2.41618 ±0.00007 
a 2 = -2.37669 ±0.00004 

with a dispersion of 0.36345 for the short periods and a dispersion of 0.55620 for the long periods. This fit is shown 
in Fig. 1271 As can be seen also this fit with a common suspension point does produce similar slopes for the short and 
long period Cepheids. 
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Fig. 27. — Common suspension point fit for FM Cepheids in the rpi band (c.f. equations I16I and I17| l. The resulting slopes for the short 
period and long period subsample are very similar. 



To check if the dispersion of the PLR is concealing a steeper slope we generate a simulated subsample for the short 
period Cepheids (in the rpi band) in such a way that the periods of the Cepheids stay the same, but the magnitude 
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is given by a Gaussian distribution around the slope -2.8. The width of the Gaussian distribution is chosen to be the 
dispersion of the common suspension fit of the short period Cepheids. The subsample of the long period Cepheids stays 
the same. We generate 100000 Gaussian distributions and perform the common suspension point fit. The resulting 
distribution of the short period slopes is shown in Fig. [28] As can be seen the shallow slope of the short period 
subsample cannot be explained by the dispersion of the PLR. 
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Fig. 28. — Distribution of the period subsample slopes for a Gaussian distribution around the slope -2.8 for the FM Cepheids in the rpi 
band. The width of the Gaussian distribution is chosen to be the dispersion of the common suspension fit of the short period Cepheids 
(0.36345). A to tal of 100000 Gaussian distributions was used to determine the distribution of the short slopes. As can be seen the shallow 
slope from Fig. 1271 cannot be explained by the dispersion of the PLR. 

The most likely explanation for the shallow slope at short periods is that we seem to select the brighter Cepheids at 
short periods due to the Malmquist bias and that most short period Cepheids are too faint to be detected, so that we 
cannot determine the slope for the short periods correctly. To confirm this we would need to perform completeness 
tests that are beyond the scope of this work. 

Never theless the good agreem ent of our empir i cal va lues for the long period Cepheids with the predictions of lBono et al.l 
(|2010( ) supports the finding of iSandage et al.l (|2009l ) of a different slope for long and short period Cepheids, depending 
on the observational band, and the more general prediction of a curvature term in the PLR. 

6.5. Spatial Cepheid distribution 

We now investigate the l ocation of the Cepheids in the M31 plane. Fig. [29] shows the position of the Cepheids plotted 
over the E(B-V) map of iMontalto et al.l ([20091 ) . The FM and FO Cepheids are concentrated towards the disk and 
clearly show a ring like structure (left panel in Fig. |29|) . while the older Type II Cepheids trace the halo of M31 (right 
panel in Fig. |2"9"|) . Fig. I3TH shows a contour plot of the distribution of the Cepheids. While we can see the same 
behavior of the FM and FO Cepheids as in Fig. [29] we observe that the Type II distribution is limited by the area of 
the skycells that were analyzed. For the next data release we will increase this area so as to better trace the halo . We 
do not discuss the spatial distribution of Cepheids around NGC 206 but refer the reader to[Magnicr et al.l (|1997t ). 

6.6. Spatial age distribution 

The spatial distribution of FM and FO Cepheids can b e used to obtain a spatial age distribution for M31. We use the 
period-age relation for FM and FO Cepheids given by iBono et al.l (|2005| ) to determine the age of each Cepheid. The 
spatial age distribution shown in Fig. I3T1 indicates that st ar formation in the last ~ 100 Myr was concentrated in a 
ring which is in good agreement with lDavidge et a l. (2012). Fig. l3"T1also hints a correlation between the Cepheid age 
and the distance to the center of M31. 

In order to check for this correlation we deproject the spatial distr ibution. We use 75° for the inclination and 37° for 
the tilt angle and the radius and offset of the 10 kpc ring given bv lGordon et al.l (120061) . The deproje ct ed age map is 
shown in Fig. [32] We found the splitting of the ring as previously described in iGordon et al.l (|2006l ) . iGordon et al.l 
(2006) attributes the star formation of the ring and its splitting to a passage of M32 through the M31 disk. In a next 
step we analyze the age of the Cepheids as a function of distance to the 10 kpc ring. For this analysis we exclude 
all Cepheids in the splitted part of the ring (150 < <p < 260, c.f. right panel in Fig. I3"2"j) . Fig. 1531 shows the median 
Cepheid age as a function of the distance to the 10 kpc ring in bins of 0.5 deg. The first and the last two bins contain 
less than 10 Cepheids and can therefore be neglected. The errors in each bin have been determined with the bootstrap 
method (c.f. Section[3]) and are rather small compared to the dispersion in the bins we consider. The upper left corner 
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Fig . 29. — RA-Dec of the 3-dimensional parameter space classified Cepheid catalog, plotted over the E(B-V) map of Montalto ct al. 
(2009). It can clearly be seen that the FM and FO Cepheids trace the spiral arms, while the Type II Cepheids trace the M31 halo. Left 
panel: FM and FO Cepheid distribution. Right panel: Type II Cepheid distribution. 




RA (J2000.0) 

Fig. 30. — C ontour plot of the RA- Dec distribution for the 3-dimcnsional parameter space classified Cepheid catalog, plotted over the 
E(B-V) map of Mon talto etall l(2009l 1. It can be seen that the FM and FO distribution (cyan dashed lines) follows the spiral arms, while 
the Type II distribution (green solid lines) traces the M31 halo. The Type II distribution is limited by the skycells (red dotted lines) that 
were analyzed until now. The ten contour line levels represent 1% - 10% of the respective Cepheid type number. For the FM and FO 
distribution this number is the total number of FM and FO combined. 

of the upper panel of Fig. [33] contains no Cepheids. In this region (the center of M31) the signal to noise ratio is low, 
so that this is likely the reason that we hardly detect faint Cepheids (i.e. Cepheids with small periods; the study of 
the detection efficiency for Cepheids will be subject of a further work) and thus we hardly detect old Cepheids in this 
region. In the seventh bin we detect old (faint) Cepheids but no young Cepheids, although they should be more easily 
detectable than old Cepheids. Therefore the lack of young Cepheids in the outer region of the ring is no selection 
bias. This leaves us with the conclusion that the age gradient is real. To measure the strength of the age gradient we 
perform a fit to the median age with y = a + b ■ x: 

a = 55.4941 ± 0.7692 [Myr] 
b = 33.8517 ± 0.6100 [Myr/deg] 

with a = 0.36. We therefore conclude that the median age of the stellar population decreases by ~ 34 Myr/deg 
inwards. A possible interpretation for this is that the star formation is related to the interaction with M32 moving 
inwards. 
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Fig. 31. — Spatial age distribution for the 3-d imensional parameter space classified Cepheid catalog for differ ent Cepheid ages, plotted 
over the E(B-V) map of :Montalto et al. (2009). To calculate the age we use the period-age relations from [Bono et all (2005). The 
spatial age distribut ion shows that the star formation in last ~ 100 Myr was concentrated in a ring, which is in good agreement with 
IDavidg c ct al. (2012). Blue points: FM and FO Cepheids with t > 70 Myr; Green upward pointing triangle: FM and FO Ccphcids with 
70 Myr > t > 40 Myr; Red downward pointing triangle: t < 40 Myr. Left panel: RA-Dec distribution. Right panel: RA-Dec distribution 
with the median age for each bin. 
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Fig. 32. — Deprojected spatial age distribut ion fo r the 3-dimensional parameter space classified Cepheid catalog. To calcul ate the age we 
use th e period-age relations from Bono ct al. (200§j). The ring and the splitting of the ring near M3 2 have been discussed in Gord on" et all 
(2006). It can be seen that the star formation is clustered at the 10 kpc ring (Davidgc et al. 2012). Blue points: FM and FO Cepheids 
with t > 70 Myr; Green upward pointing trian gle: FM and FO Ce pheids with 70 Myr > t > 40 Myr; Red downward pointing triangle: 
t < 40 Myr, black solid line: 10 kpc ring from Gordon ct al. (2006) (the center of the ring is the black cross and the magenta point the 
center of M31). Left panel: Deprojected age map. Right panel: Deprojected age map in polar coordinates. 

7. CONCLUSION AND OUTLOOK 

We present a large sample of Cepheids in M31 in the rpi and ipi filters. We develop an automatic Cepheid detection 
and classification scheme based on the Fourier parameters P, Ai\ and </?2i and the location of the instability strip in 
the Wesenheit-color plane. This makes the Cepheid detection and classification less biased than traditional methods 
and facilitates Cepheid detection in large surveys. 

We find 1440 fundamental mode (FM) Cepheids, 126 Cepheids in the first overtone mode (FO), 147 belonging to the 
Populati on II types and 296 C epheids tha t could not be classified. 354 of the 20 09 Cep heids could be found in other 
surveys (jVilardell et all 120071 iFliri et al.l [20061 and the DIRECT project, iMacril l2004f) with a matching radius of 1 
arcsec. The matching tables are provided in electronical form. 

The spatial distribution of FM and FO Cepheids follows the 10 kpc ring in M31, while the Type II Cepheids are 
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Fig. 33. — Age distribution as a function of distance to the 10 kpc ring. Th e definition of the symbols and colors is the same as in Fig. 
1321 The splitted part of the ring (150 < ip < 260, c.f. right panel in Fig. 132 I t was excluded. In the top panel the age distribution for the 
FM and FO Cepheids can be seen. The bottom panel shows the median age for bins with width of 0.5 deg. The first and the last two 
bins contain less than 10 Cepheids and thus can be neglected. The errors of the medians have been determined with the bootstrap method 
(c.f. Section The errors determined with the bootstrap method are shown in magenta with an offset of 0.025 deg for better visibility. 
The dispersion of each bin is shown in black. We observe an age gradient and the fit to the median age (y = a + b ■ x) is shown as a solid 
magenta line. The age gradient suggests that the star formation related to the interaction to M32 moved inwards. No Cepheids can be 
found in the top left corner of the upper panel (the center of M31). We hardly detect old (faint) Cepheids in this region, most likely due to 
the low signal to noise ratio in the center of M31. The lack of young Cepheids in the outer region of the ring is no selection bias, since we 
detect old (faint) Cepheids in the seventh bin, but no young Cepheids, although they should be more easily detectable than old Cepheids. 
This leaves us with the conclusion that the age gradient is real. 

distributed throughout the halo of M31. The spatial age distribution shows a positive age gradient with increasing 
distance from the 10 kpc ring, which implies that the star formation in the 10 kpc ring moved inwards. 
The analysis of our period luminosity relations (PLR) indicates that the PLR is slightly curved. This has ramifications 
on the determination of extra galactic distances, since most of the commonly used calibrations of the PLR are dominated 
by the short period Cepheids and the typical Cepheids that are used for the extra galactic distance determination are 
long period Cepheids. 

The next data release will cover more area of the M31 halo, thus we will be able to trace the halo better. With another 
year of observations we will be able to extend the sample to larger periods. Overlapping skycells will improve the 
calibration and enable completeness tests. 

The complete catalog of the 2009 Cepheids can be found electronically in the CDS database. 
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